Min-max theorem

From HandWiki
Short description: Variational characterization of eigenvalues of compact Hermitian operators on Hilbert spaces

In linear algebra and functional analysis, the min-max theorem, or variational theorem, or Courant–Fischer–Weyl min-max principle, is a result that gives a variational characterization of eigenvalues of compact Hermitian operators on Hilbert spaces. It can be viewed as the starting point of many results of similar nature.

This article first discusses the finite-dimensional case and its applications before considering compact operators on infinite-dimensional Hilbert spaces. We will see that for compact operators, the proof of the main theorem uses essentially the same idea from the finite-dimensional argument.

In the case that the operator is non-Hermitian, the theorem provides an equivalent characterization of the associated singular values. The min-max theorem can be extended to self-adjoint operators that are bounded below.

Matrices

Let A be a n × n Hermitian matrix. As with many other variational results on eigenvalues, one considers the Rayleigh–Ritz quotient RA : Cn \ {0} → R defined by

RA(x)=(Ax,x)(x,x)

where (⋅, ⋅) denotes the Euclidean inner product on Cn. Equivalently, the Rayleigh–Ritz quotient can be replaced by

f(x)=(Ax,x),x=1.

The Rayleigh quotient of an eigenvector v is its associated eigenvalue λ because RA(v)=(λx,x)/(x,x)=λ. For a Hermitian matrix A, the range of the continuous functions RA(x) and f(x) is a compact interval [a, b] of the real line. The maximum b and the minimum a are the largest and smallest eigenvalue of A, respectively. The min-max theorem is a refinement of this fact.

Min-max theorem

Let A be Hermitian on an inner product space V with dimension n, with spectrum ordered in descending order λ1...λn.

Let v1,...,vn be the corresponding unit-length orthogonal eigenvectors.

Reverse the spectrum ordering, so that ξ1=λn,...,ξn=λ1.

(Poincaré’s inequality) — Let M be a subspace of V with dimension k, then there exists unit vectors x,yM, such that

x,Axλk, and y,Ayξk.

min-max theorem — λk=maxVdim()=kminxx=1x,Ax=minVdim()=nk+1maxxx=1x,Ax

Define the partial trace trV(A) to be the trace of projection of A to V. It is equal to ivi*Avi given an orthonormal basis of V.

Wielandt minimax formula ([1]Template:Pg) — Let 1i1<<ikn be integers. Define a partial flag to be a nested collection V1Vk of subspaces of n such that dim(Vj)=ij for all 1jk.

Define the associated Schubert variety X(V1,,Vk) to be the collection of all k dimensional subspaces W such that dim(WVj)j.

λi1(A)++λik(A)=supV1,,VkinfWX(V1,,Vk)trW(A)

Proof

This has some corollaries:[1]Template:Pg

Extremal partial trace — λ1(A)++λk(A)=supdim(V)=ktrV(A)

ξ1(A)++ξk(A)=infdim(V)=ktrV(A)

Corollary — The sum λ1(A)++λk(A) is a convex function, and ξ1(A)++ξk(A) is concave.

(Schur-Horn inequality) ξ1(A)++ξk(A)ai1,i1++aik,ikλ1(A)++λk(A) for any subset of indices.

Equivalently, this states that the diagonal vector of A is majorized by its eigenspectrum.

Schatten-norm Hölder inequality — Given Hermitian A,B and Hölder pair 1/p+1/q=1, |tr(AB)|ASpBSq

Proof

Counterexample in the non-Hermitian case

Let N be the nilpotent matrix

[0100].

Define the Rayleigh quotient RN(x) exactly as above in the Hermitian case. Then it is easy to see that the only eigenvalue of N is zero, while the maximum value of the Rayleigh quotient is 1/2. That is, the maximum value of the Rayleigh quotient is larger than the maximum eigenvalue.

Applications

Min-max principle for singular values

The singular values {σk} of a square matrix M are the square roots of the eigenvalues of M*M (equivalently MM*). An immediate consequence of the first equality in the min-max theorem is:

σk=maxS:dim(S)=kminxS,x=1(M*Mx,x)12=maxS:dim(S)=kminxS,x=1Mx.

Similarly,

σk=minS:dim(S)=nk+1maxxS,x=1Mx.

Here σk denotes the kth entry in the decreasing sequence of the singular values, so that σ1σ2.

Cauchy interlacing theorem

Let A be a symmetric n × n matrix. The m × m matrix B, where mn, is called a compression of A if there exists an orthogonal projection P onto a subspace of dimension m such that PAP* = B. The Cauchy interlacing theorem states:

Theorem. If the eigenvalues of A are α1 ≤ ... ≤ αn, and those of B are β1 ≤ ... ≤ βj ≤ ... ≤ βm, then for all jm,
αjβjαnm+j.

This can be proven using the min-max principle. Let βi have corresponding eigenvector bi and Sj be the j dimensional subspace Sj = span{b1, ..., bj}, then

βj=maxxSj,x=1(Bx,x)=maxxSj,x=1(PAP*x,x)minSjmaxxSj,x=1(A(P*x),P*x)=αj.

According to first part of min-max, αjβj. On the other hand, if we define Smj+1 = span{bj, ..., bm}, then

βj=minxSmj+1,x=1(Bx,x)=minxSmj+1,x=1(PAP*x,x)=minxSmj+1,x=1(A(P*x),P*x)αnm+j,

where the last inequality is given by the second part of min-max.

When nm = 1, we have αjβjαj+1, hence the name interlacing theorem.

Lidskii's inequality

Lidskii inequality — If 1i1<<ikn then λi1(A+B)++λik(A+B)λi1(A)++λik(A)+λ1(B)++λk(B)

λi1(A+B)++λik(A+B)λi1(A)++λik(A)+ξ1(B)++ξk(B)

Proof

Note that iλi(A+B)=tr(A+B)=iλi(A)+λi(B). In other words, λ(A+B)λ(A)λ(B) where means majorization. By the Schur convexity theorem, we then have

p-Wielandt-Hoffman inequality — λ(A+B)λ(A)pBSp where Sp stands for the p-Schatten norm.

Compact operators

Let A be a compact, Hermitian operator on a Hilbert space H. Recall that the spectrum of such an operator (the set of eigenvalues) is a set of real numbers whose only possible cluster point is zero. It is thus convenient to list the positive eigenvalues of A as

λkλ1,

where entries are repeated with multiplicity, as in the matrix case. (To emphasize that the sequence is decreasing, we may write λk=λk.) When H is infinite-dimensional, the above sequence of eigenvalues is necessarily infinite. We now apply the same reasoning as in the matrix case. Letting SkH be a k dimensional subspace, we can obtain the following theorem.

Theorem (Min-Max). Let A be a compact, self-adjoint operator on a Hilbert space H, whose positive eigenvalues are listed in decreasing order ... ≤ λk ≤ ... ≤ λ1. Then:
maxSkminxSk,x=1(Ax,x)=λk,minSk1maxxSk1,x=1(Ax,x)=λk.

A similar pair of equalities hold for negative eigenvalues.

Self-adjoint operators

The min-max theorem also applies to (possibly unbounded) self-adjoint operators.[2][3] Recall the essential spectrum is the spectrum without isolated eigenvalues of finite multiplicity. Sometimes we have some eigenvalues below the essential spectrum, and we would like to approximate the eigenvalues and eigenfunctions.

Theorem (Min-Max). Let A be self-adjoint, and let E1E2E3 be the eigenvalues of A below the essential spectrum. Then

En=minψ1,,ψnmax{ψ,Aψ:ψspan(ψ1,,ψn),ψ=1}.

If we only have N eigenvalues and hence run out of eigenvalues, then we let En:=infσess(A) (the bottom of the essential spectrum) for n>N, and the above statement holds after replacing min-max with inf-sup.

Theorem (Max-Min). Let A be self-adjoint, and let E1E2E3 be the eigenvalues of A below the essential spectrum. Then

En=maxψ1,,ψn1min{ψ,Aψ:ψψ1,,ψn1,ψ=1}.

If we only have N eigenvalues and hence run out of eigenvalues, then we let En:=infσess(A) (the bottom of the essential spectrum) for n > N, and the above statement holds after replacing max-min with sup-inf.

The proofs[2][3] use the following results about self-adjoint operators:

Theorem. Let A be self-adjoint. Then (AE)0 for E if and only if σ(A)[E,).[2]: 77 
Theorem. If A is self-adjoint, then

infσ(A)=infψ𝔇(A),ψ=1ψ,Aψ

and

supσ(A)=supψ𝔇(A),ψ=1ψ,Aψ.[2]: 77 

See also

References

  1. 1.0 1.1 Tao, Terence (2012). Topics in random matrix theory. Graduate studies in mathematics. Providence, R.I: American Mathematical Society. ISBN 978-0-8218-7430-1. 
  2. 2.0 2.1 2.2 2.3 G. Teschl, Mathematical Methods in Quantum Mechanics (GSM 99) https://www.mat.univie.ac.at/~gerald/ftp/book-schroe/schroe.pdf
  3. 3.0 3.1 Lieb; Loss (2001). Analysis. GSM. 14 (2nd ed.). Providence: American Mathematical Society. ISBN 0-8218-2783-9. 
  • Fisk, Steve (2005). "A very short proof of Cauchy's interlace theorem for eigenvalues of Hermitian matrices". arXiv:math/0502408.
  • Hwang, Suk-Geun (2004). "Cauchy's Interlace Theorem for Eigenvalues of Hermitian Matrices". The American Mathematical Monthly 111 (2): 157–159. doi:10.2307/4145217. https://www.jstor.org/stable/4145217. 
  • Kline, Jeffery (2020). "Bordered Hermitian matrices and sums of the Möbius function". Linear Algebra and Its Applications 588: 224–237. doi:10.1016/j.laa.2019.12.004. 
  • Reed, Michael; Simon, Barry (1978). Methods of Modern Mathematical Physics IV: Analysis of Operators. Academic Press. ISBN 978-0-08-057045-7. https://www.elsevier.com/books/iv-analysis-of-operators/reed/978-0-08-057045-7. 
  • Edmunds, D. E.; Evans, W. D. (2018). "11.1 The Max–Min Principle for Semi-Bounded, Self-Adjoint Operators". Spectral theory and differential operators. Oxford science publications (2 ed.). Oxford: Oxford University Press. ISBN 978-0-19-881205-0. https://academic.oup.com/book/26643/chapter/195374842. 

Template:Analysis in topological vector spaces Template:Spectral theory